Detailed analysis of an enriched deep intronic ABCA4 variant in Irish Stargardt disease patients

Over 15% of probands in a large cohort of more than 1500 inherited retinal degeneration patients present with a clinical diagnosis of Stargardt disease (STGD1), a recessive form of macular dystrophy caused by biallelic variants in the ABCA4 gene. Participants were clinically examined and underwent either target capture sequencing of the exons and some pathogenic intronic regions of ABCA4, sequencing of the entire ABCA4 gene or whole genome sequencing. ABCA4 c.4539 + 2028C > T, p.[= ,Arg1514Leufs*36] is a pathogenic deep intronic variant that results in a retina-specific 345-nucleotide pseudoexon inclusion. Through analysis of the Irish STGD1 cohort, 25 individuals across 18 pedigrees harbour ABCA4 c.4539 + 2028C > T and another pathogenic variant. This includes, to the best of our knowledge, the only two homozygous patients identified to date. This provides important evidence of variant pathogenicity for this deep intronic variant, highlighting the value of homozygotes for variant interpretation. 15 other heterozygous incidents of this variant in patients have been reported globally, indicating significant enrichment in the Irish population. We provide detailed genetic and clinical characterization of these patients, illustrating that ABCA4 c.4539 + 2028C > T is a variant of mild to intermediate severity. These results have important implications for unresolved STGD1 patients globally with approximately 10% of the population in some western countries claiming Irish heritage. This study exemplifies that detection and characterization of founder variants is a diagnostic imperative.

www.nature.com/scientificreports/ The vast phenotypic spectrum encompassed by STGD1 is caused by the high level of allelic heterogeneity observed in ABCA4 and the wide range of effects of variants. The ClinVar database currently lists 651 individual variants with an associated severity rating of likely pathogenic or pathogenic within the ABCA4 gene 13 . A metaanalysis has listed over 2200 incidents of pathogenic ABCA4 variants to date which are publicly available at www. lovd. nl/ ABCA4 14 . The combination of variants appears to have an impact on the clinical presentation of the patient 15,16 . Disease can range from early onset pan-retinal dystrophy as a result of severe null or null-like variants to onset in adolescence or even later in life due to combinations of severe variants with variants of intermediate severity and milder variants with incomplete penetrance such as ABCA4 c.5603 T > C, p.Asn1868Ile 17,18 . Genotype-phenotype correlations for pathogenic ABCA4 variants have yet to be fully elucidated, as have correlations for potential disease modifiers 19 .
As ABCA4 is a large gene, comprising 128,315 nucleotides, traditional methods of screening have primarily utilized an exon focused sequencing approach. The aim of such studies is to identify disease causing protein coding variants. However, these methods do not ascertain the cause of disease in all cases [20][21][22][23] . This highlights the need to broaden the sequencing approach to include non-coding regions of ABCA4 in order to fully understand the landscape of ABCA4 related disease 24 . Studies that employ this methodology have identified several novel non-coding pathogenic ABCA4 variants [25][26][27][28][29][30][31] . Variants detected in the non-canonical splice site regions can lead to exon skipping or perturbation of the natural length of the exon 32 . In addition, many non-coding variants are located deep within introns and can result in the incorporation of a pseudoexon into the mature mRNA transcript due to the recruitment of splicing machinery to cryptic splice sites 24,33 . These pseudoexons often disrupt the reading frame or contain premature stop codons, potentially causing the transcript to undergo nonsense mediated decay 34 .
The aberrant splicing patterns identified as a result of intronic variants often do not occur in all transcripts, with a degree of wild-type mature mRNA produced in some incidents 24 . This can depend on the level of similarity of the model utilized for functional analyses to the retina 25,35,36 . Thus, considerable attention should be paid to both the model and the ratio of wild type to mutant transcript observed in order to understand disease manifestation and variant severity. However, it is imperative that follow up clinical studies are carried out to ascertain the true contribution of these variants to disease. Examples include ABCA4 c.5461-10 T > C which has been classified as severe based functional studies using patient derived fibroblasts 37 and photoreceptor precursor cells (PPCs) as well as the clinical presentation of four homozygous individuals 38 . Matynia et al. utilised patient induced pluripotent stem cell derived RPE cell models to investigate 4 genetically unresolved STGD1 cases. Subsequently, they identified the cause of disease in 2 of these patients and classified their severity using transcriptomics 39 . Additionally, ABCA4 c.5196 + 1137G > A was examined in keratinocytes derived from patients heterozygous for this variant resulting in a 73 nucleotide pseudoexon inclusion 33 . It has since been examined in patient-derived PPCs and has been categorized as a variant of intermediate severity following detailed genotype-phenotype analysis in a substantial cohort of patients 36 .
A landmark study on ABCA4 recently provided the most comprehensive analysis of genetic variation in the entire ABCA4 gene to date 24 . In this study, seven patients harboured ABCA4 c.4539 + 2028C > T, p.[= ,Arg1514Leufs*36] of which, five were patients recruited from Ireland. In the current study, through retrospective and continued analyses of the Irish STGD1 cohort, a total of 25 individuals have been identified to date, including two individuals who carry this variant homozygously. This, in conjunction with an evaluation of population controls from Ireland, suggests significant enrichment of this variant in the Target 5000 STGD1 cohort. ABCA4 c.4539 + 2028C > T was first reported by Braun et al. 33 , with functional analyses carried out by Albert et al. 25 . RNA extracted from patient-derived PPCs was examined with a 345-nucleotide pseudoexon inclusion observed in ~ 15% of all transcripts and ~ 30% of transcripts from this allele. A comprehensive clinical characterization of a group of patients with ABCA4 c.4539 + 2028C > T is provided. Impact of the variant on disease severity is evaluated through assessment of the phenotype of homozygous patients versus those who carry the variant in trans with a null or null-like variant. In addition, we phenotypically compare these individuals to those who carry ABCA4 variants of known severity in trans with a null or null-like variant 15 .

ABCA4 c.4539 + 2028C > T is enriched in the Irish STGD1 population. 27 alleles of ABCA4
c.4539 + 2028C > T have now been identified in the ABCA4 sequenced Irish STGD1 population, making this variant the fourth most frequently detected variant in the genetically resolved Irish STGD1 cohort (Table S2). Sex and genotype information for these patients is provided in Table 1. This includes two homozygous individuals, resulting in a total of 25 patients carrying ABCA4 c.4539 + 2028C > T in Ireland to date. A total of 15 other incidents of the variant have been identified across 5 STGD1 studies in 3,940 alleles tested globally 24,30,33,40,41 (Fig. 1). ABCA4 c.4539 + 2028C > T has a population frequency of 0.00003943 in gnomAD (v3.1) 42 , totaling 6/152,182 alleles. A population frequency of 0.000044 was observed in a recent study of 5579 bi-allelic STGD1 patients 43 . Of the 27 alleles of ABCA4 c.4539 + 2028C > T, 24 of these had the variant ABCA4 c.302 + 68C > T detected concurrently. ABCA4 c.302 + 68C > T has an allele frequency of 0.00004598 in gnomAD (v3.1) 42 . The complex allele consisting of these two variants has been previously reported several times 30,33,44 . We could not determine the length of the shared haplotype between individuals in this study as it is limited by the nature of sequencing that has been performed on these patients. However, supplemental Fig. 1 displays variation identified across the ABCA4 gene in all individuals who have ABCA4 c.4539 + 2028C > T and ABCA4 whole gene sequencing. This illustrates that both homozygotes share extremely similar alleles, and it may be possible that this allele is also present in the individuals who carry this variant in a heterozygous manner. Additionally, the numbers of carriers of the complex allele and the variant of interest only in this study were not sufficient to determine if the presence of the second variant had an additional effect on the severity of the phenotype. In Clinical characterisation of individuals carrying ABCA4 c.4539 + 2028C > T. Detailed phenotypic data including visual acuity, fundus autofluorescence (FAF), optical coherence tomography (OCT), electroretinography (ERG) and age at onset was collected from all 25 patients harbouring ABCA4 c.4539 + 2028C > T, where available, including two homozygous individuals. The median age at last examination was 38 (range =  years. In order to determine the severity of ABCA4 c.4539 + 2028C > T, two indivduals who carry this variant in a homozygous manner were compared to four individuals who carry this variant in trans with a null or null-like variant. The phenotype of these four individuals allows surveyance of the effect of ABCA4 c.4539 + 2028C > T without residual function from the other allele, while the clinical presentation of the homozygous individuals highlights the direct phenotypic outcome of ABCA4 c.4539 + 2028C > T without the presence of any additional pathogenic ABCA4 variants. The individuals who carry this variant in trans with a null or null-like variant were also compared to individuals who carry variants of known severity in trans with a null or null-like variant as described by Fakin et al. 15 . Details of these variants are included in the methods section. These comparisons were performed in order to gauge the pathogenicity of ABCA4 c.4539 + 2028C > T on the STGD1 severity spectrum. Visual acuity. The visual acuity of all individuals harbouring ABCA4 c.4539 + 2028C > T, where available, can be found in Table 2. The median visual acuity of the homozygotes was 6/33.5 at a median age of 47.5, with the female having visual acuity of 6/48 at age 58 and the male having a visual acuity of 6/19 at age 37. By comparison, the median visual acuity of those who harboured ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant was 6/48 (range = 6/36-60, n = 4) at a median age of 36.5 (range = 16-41, n = 4). In addition, the median visual acuity of those who carried mild variants in trans with a null or null-like variant was 6/60 (range = 6/7.5-60, n = 9) at a median age of 33 (range = 17-50, n = 9). The median visual of those who carried intermediate variants   Table 3.
The foveal photoreceptors were affected on optical coherence tomography (OCT) in all individuals, with none exhibiting foveal sparing (Fig. 2). One participant displayed only mild foveal deterioration, however this individual was 13 years old at the time of multimodal imaging and the severe manifestations of the phenotype may not yet be fully apparent at this age (  Table 4. The median CRT, OCT atrophy area, age and sample size for the c.4539 + 2028C > T homozygotes, patients who carried ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant, patients who carried mild variants in trans with a null or null-like variant and patients who carried intermediate in trans with a null or null-like variant can be found in Table 5. The homozygous individuals have a less severely affected median CRT and OCT atrophy area at a later median age than all three other groups. The patients who carried ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant had median CRT and OCT atrophy areas intermediate relative to that of patients who carried the mild and intermediate variants in trans with a null or null-like variant (Fig. 3). www.nature.com/scientificreports/ Electroretinography. Electroretinography (ERG) data was available for 14/25 participants (Table 6), all recruited from one institution and examined by the same clinician. The median age at ERG recording was 37 (range = 18-56). In accordance with a published ERG classification system 45  Age at onset. The median age at onset for those with ABCA4 c.4539 + 2028C > T was 15 (range = 6-49 years).
As age at onset data on ABCA4 variants of known severity was available through prior publications, these data were used in statistical analyses 15 . In order to assess the severity of ABCA4 c.4539 + 2028C > T, based on age at disease onset, 4 patients who carry this variant in trans with a null or null-like variant were compared with 12 previously reported double null/null-like individuals 15 , 28 individuals who carry mild variants in trans with a null or null-like variant (8 from the Target 5000 cohort, 20 previously reported 15 ) and 29 individuals who carry intermediate variants in trans with a null or null-like variant (5 from the Target 5000 cohort, 24 previously reported 15 ) (Fig. 6). Median ages of onset were compared across all groups using a Kruskal-Wallis ranked sum test (p = 0.0000008837), with post hoc analysis carried out using a pairwise Wilcoxon rank sum test, correcting for multiple testing using the Holm-Bonferroni method. Only one individual from each pedigree was included in the tests in order to control for factors such as shared environment, which may influence phenotypic outcome. A significant difference was observed between the median age at onset of those who carried ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant and those who were carried two null/null-like variants (p = 0.0082). The median age at onset of those who carried ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant was 14 (range = 12-17, n = 4), while the median age at onset of those who carried two null/nulllike variants was 6 (range = 4-10, n = 12). Those who carried a variant of intermediate severity in trans with a null or null-like variant had a median age of onset of 16 (range = 6-53, n = 28). The median age at onset for those who carried a mild variant in trans with a null or null-like variant was 25 (range = 11-56, n = 28). A significant difference was not observed between the median age at onset of those who carried ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant and those who carried a mild or intermediate variant in trans with a null or null-like variant (p = 0.1042 and p = 0.6581 respectively) (Fig. 6). Comparison of a variant of interest found in trans with a null or null-like allele with individuals who carry two null/null-like variants has been used to interrogate variant severity previously 15,36 . However, as age at onset is subject to recall bias (the proband may not have

Discussion
In this study, we collated data on Irish participants with STGD1 who possess a pathogenic deep-intronic ABCA4 variant, c.4539 + 2028C > T representing the largest cohort of such patients identified globally to date. We clearly illustrate the enrichment of ABCA4 c.4539 + 2028C > T in the Irish STGD1 population, through analysis of this group and examination of the literature and databases for additional cases globally. A detailed genotype-phenotype analysis was performed suggesting that this deep intronic ABCA4 variant results in a phenotype between mild and intermediate severity. Homozygotes present during adulthood and display well-circumscribed, localized, profound macular atrophy with minimal fleck lesions. ABCA4 c.4539 + 2028C > T was first described by Braun et al. 33 in a study that described some of the first deepintronic variants in ABCA4. Functional analysis was performed by Albert et al. 25 . RNA was first extracted from patient fibroblast cell lines, but aberrantly spliced product was not detected. To further investigate retina-specific splicing defects, RNA extracted from patient derived PPCs was examined and a 345 nucleotide pseudoexon inclusion was observed in ~ 7.5% of all transcripts and ~ 15% of transcripts derived from the ABCA4 c.4539 + 2028C > T allele. This phenomenon, wherein a model that better recapitulates the patient displays a splice defect where one was not previously detected, has been noted previously 25,47 . The presence of ABCA4 c.4539 + 2028C > T is predicted to create one SC35 and two SRp40 motifs by ESE finder 3.0 48 . ABCA4 c.4539 + 2028C > T and ABCA4 c.4539 + 2001G > A were the first reports of a pseudoexon inclusion in inherited retinal disease due to the creation of novel exon splice enhancer motifs, as opposed to the direct creation of cryptic splice acceptor/donor sites 25 . The neighboring c.4539 + 2001G > A variant that also results in the 345-nt pseudo-exon insertion showed 75% wild-type transcript though genotype-phenotype correlations classified it as 'severe' . Given the nature of c.4539 + 2028C > T and additionally its enrichment in the Irish STGD1 patient cohort, it was of interest to explore the phenotypic consequences of this variant in patients.
All patients harbouring ABCA4 c.4539 + 2028C > T and a known disease-causing variant presented with phenotypes that fall on the spectrum of STGD1. No other pathogenic candidate variants were detected using a variety of sequencing procedures, such as target capture next-generation sequencing 49-51 , entire ABCA4 gene  www.nature.com/scientificreports/ sequencing 24 , and whole genome sequencing. All individuals exhibited FAF abnormalities, with this abnormality extending beyond the vascular arcades in 13/25 individuals. In general, a normal/mild peripheral phenotype was observed. However, severe atrophy of the central macula was detected in most cases. Four participants harboured ABCA4 c.4539 + 2028C > T in trans with a null or null-like variant and two participants were found to carry the variant in a homozygous manner. These individuals were phenotypically compared with those harbouring mild or intermediate variants in trans with null or null-like variants. Age at onset, ERG amplitudes as well as FAF and OCT patterns were used to assess variant severity, suggesting that the phenotype of the individuals who were homozygous for ABCA4 c.4539 + 2028C > T was most similar to those that had been described as intermediate previously 15 . Other lines of evidence support this hypothesis, for example, ABCA4 c.4539 + 2028C > T was not detected in trans with a variant previously categorized as mild. This may be because when in trans with a mild variant, the level of functional ABCA4 protein produced is sufficient to prevent significant loss of vision and thus individuals did not present for examination. As mentioned previously, functional analyses in PPCs indicated that residual wild-type transcript is produced in the presence of ABCA4 c.4539 + 2028C > T 25 . The resulting phenotype also greatly depends on the pathogenic variant found in trans, providing additional evidence of between mild  www.nature.com/scientificreports/ and intermediate severity. This is supported by the phenotype of both homozygous individuals who were older at disease onset, have a more well-preserved peripheral phenotype, better ERG amplitudes (ERG group 1: normal ffERG) and better visual acuity over time than those who carry the variant in trans with a null or null-like variant. It is of note that the disease observed in the two homozygotes is not widespread within the retina, but the fovea and perifoveal area is profoundly impacted in both.
Population-specific studies have provided a wealth of data describing the genomic architecture of IRDs in particular countries, identifying variants present at distinctly higher frequencies than other cohorts [52][53][54][55][56][57] . Similarly, it has become apparent that ABCA4 c.4539 + 2028C > T is enriched in the Irish STGD1 population. A total of 27 incidents of this variant have been identified in our cohort to date, by comparison with 15 incidents  www.nature.com/scientificreports/  www.nature.com/scientificreports/ of the variant identified across 5 studies in 3,940 alleles tested globally 24,30,33,40,41 . The Irish cohort includes, to the best of our knowledge, the only two patients homozygous for this variant identified to date. A complex allele containing ABCA4 c.4539 + 2028C > T and ABCA4 c.302 + 68C > T, is frequently observed, but not in all incidents. Among the 27 ABCA4 c.4539 + 2028C > T alleles detected in Irish STGD1 patients, 24 incidents of c.302 + 68C > T were also detected. The segregation of these two variants in a complex allele has been reported previously 30,33,44 and also accounts for the majority of Irish cases with c.4539 + 2028C > T. In our cohort, although ABCA4 c.4539 + 2028C > T could be observed both in the presence of ABCA4 c.302 + 68C > T or in isolation, ABCA4 c.302 + 68C > T was only observed as part of the complex allele. Importantly, for those with sequencing datasets that are limited to the exonic regions of ABCA4, c.302 + 68C > T may be captured due to the nature of target capture sequencing by hybridization and its proximity to exon 3. As ABCA4 c.302 + 68C > T has not been shown to have a qualitative effect on the resulting mRNA 25 , directly sequencing for ABCA4 c.4539 + 2028C > T, in cases where c.302 + 68C > T has been detected may aid in providing a more conclusive genetic diagnosis where one has not yet been determined. This approach adds value to existing exome or target panel data and may reduce the need for whole gene or whole genome sequencing to resolve partial STGD1 diagnoses, particularly in cases where the patient is of Irish descent. This is internationally relevant as approximately 10% of the population in some western countries, including the UK, US, Canada and Australia claim Irish heritage [58][59][60] . Pedigrees harbouring ABCA4 c.4539 + 2028C > T account for 14% of resolved STGD1 pedigrees in the Irish cohort to date. The identification of IRD patients homozygous for putative pathogenic variants is imperative, as these Mendelian conditions present with phenotypes that can be greatly attributed to that particular variant, as illustrated by other STGD1 studies 15,36 . This is particularly true of splice altering variants. Given their mostly intronic nature, pathogenicity can currently only be proven by in vitro and ex vivo models that recapitulate the patient context to a limited extent, as demonstrated by the lack of identifiable splice defect in patient derived fibroblasts harbouring ABCA4 c.4539 + 2028C > T and low percentage of mutant transcript in PPCs 25 . The participation of both homozygous patients in this study provides unique and invaluable insights into the phenotypic outcome of this variant, facilitating true variant categorisation and interpretation. In addition, as increasing numbers of homozygous patients are identified and if their phenotypes vary despite non-detection of other pathogenic variants, they may provide unique insights into potential modifier variants, ABCA4 c.4539 + 2028C > T appears to result in a small area of profound central macular atrophy without evidence of disease in the peripheral retina. Describing this genotype-phenotype correlation may enable more accurate clinical phenotyping, facilitating greater positive pre-test probability for diagnosis of this particular allele, particularly in unresolved mono-allelic cases where Irish ancestry is noted. However, as this study examines the effect of a rare variant on a rare disease in a small number of individuals, it must be noted that caution should be taken when attributing severity to individual alleles. There is variance between the individuals in each group and the numbers of individuals in each group. While cumulatively these patient groups represent by far the largest patient cohort globally for the ABCA4 c.4539 + 2028C > T variant, the numbers are still limited as expected for a rare disease. However, the study www.nature.com/scientificreports/ provides insights into the disease features associated with the ABCA4 c.4539 + 2028C > T variant. In addition, interrogation of this variant in ex vivo retinal organoids, three dimensional structures derived from patient pluripotent stem cells, may more accurately illustrate the level of mutant versus wild-type transcript produced. This may more accurately recapitulate the splice defect observed in the retina of patients. Moreover, while extremely heritable, inherited retinal diseases may not be strictly Mendelian at times. Modifier variants, environmental influences and other factors are likely to affect phenotypic outcome, with a recent study indicating that sex may play a modifying role in STGD1 17,19 .
A variety of therapies are under consideration for STGD1. Preclinical trials have employed lentiviral vectors, chosen based on their cargo capacity relative to the size of the ABCA4 gene, resulting in reduced levels of lipofuscin 61 . A phase I/II clinical trial of SAR422459 is investigating the safety and efficacy of subretinal injection and lentiviral-mediated delivery of ABCA4 62 . Additionally, dual adeno-associated virus (AAV) therapies for STGD1 are being considered 63,64 . Interestingly, the possibility of subretinal transplantation of human embryonic stem cell-derived retinal pigment epithelium cells (MA09-hRPE) is also under investigation for cases with more advanced macular/foveal atrophy. A phase I/II clinical trial has resulted in no adverse immune effects in 7/7 patients and 3 of these 7 reported increased visual acuity during a 12-month trial 65,66 . In addition, intronic variants such c.4539 + 2028C > T represent ideal candidates for antisense oligonucleotide (AON)-based therapy, employing short synthetic RNA molecules to restore correct splicing patterns. Promising pre-clinical results modulating the splice defect associated with ABCA4 c.4539 + 2028C > T have been obtained, rescuing 80% of the observed splice defect 25 . Extensive work has been carried out on the modulation of splice defects that cause other forms of IRD, namely on the USH2A 67 , CHM 68 and OPA1 69 genes. Studies are being carried out on a recurring deep-intronic variant (c.2991 + 1655A > G) in CEP290 that causes Leber congenital amaurosis 47,[70][71][72][73][74] . Like ABCA4 c.4539 + 2028C > T, this variant leads to pseudoexon incorporation in the mature mRNA transcript, and retinal-like cells (retinal organoids) showed a stronger pseudo-exon insertion then non-retinal cells 47 . AONs designed to correct this splice defect have displayed restoration of wild-type mRNA both in vitro and in vivo 75 . Furthermore, this therapeutic strategy has produced promising results in the first clinical trial using AONs for IRDs, with a phase II trial now underway 76 .
In this study, we provide detailed genetic and clinical characterization of 25 individuals with STGD1 in Ireland who carry ABCA4 c.4539 + 2028C > T, which we define as a having a macula-centred phenotype of between mild and intermediate severity. We also illustrate the enrichment of this variant in Ireland and describe a specific clinical phenotype in homozygotes as a result of this variant. Notably these findings represent important considerations for clinicians when making an accurate phenotypic diagnosis, as well as for geneticists in ascertaining a genetic diagnosis in unresolved STGD1 cases, particularly in those of Irish descent. This study also underscores the necessity of intronic as well as exonic sequencing in previously unresolved STGD1 cases. Given the widespread availability of DNA sequencing in this era of genomic medicine, it is imperative that patients' interests are at the forefront, with access to novel therapies dependent on a genetic diagnosis. With this wealth of genetic information comes the responsibility to try to establish accurate associations between genotypes and phenotypes. The current study aids this important objective with respect to STGD1. The findings should serve to facilitate additional harmonized clinical and genetic diagnoses as well as improving both clinician and patient understanding of disease progression and management in STGD1 patients who carry ABCA4 c.4539 + 2028C > T.

Participants. Participants were recruited through The Royal Victoria Eye and Ear Hospital and The Mater
Misericordiae University Hospital, Dublin as part of the Target 5000 study 77 . The participant cohort consisted of 25 individuals across 18 pedigrees clinically diagnosed with STGD and found to possess at least one allele harbouring the ABCA4 c.4539 + 2028C > T variant. 13 unaffected relatives were also recruited in order to determine the phase of variants identified. In addition, 15 affected participants carrying two ABCA4 variants of previously categorised severity and null/null-like variants 15 were recruited from Target 5000 and their phenotypic data used for comparative purposes to determine the severity the ABCA4 c.4539 + 2028C > T variant. Previously published data on age at onset from 56 affected individuals carrying variants of known severity was also included in this study. Of the 25 participants with ABCA4 c.4539 + 2028C > T, 13 were female while 12 were male, with a median age at last examination of 38 years (range: 13-70).
Clinical assessment. All study participants had a clinical diagnosis of either STGD1 or cone-rod dystrophy. Informed consent was obtained from each participant and the appropriate assessments were performed in order to determine the patient's phenotype. Clinical data collected and analysed included sex, age at last examination, age at onset of disease, visual acuity (Revised 2000 early treatment diabetic retinopathy study (ETDRS) charts, Precision Vision, La Salle, IL, USA), visual fields (Goldmann perimeter 940 (iv4e, i4e and 04e targets), Haag-Streit AG, Köniz, Switzerland), colour vision (Lanthony desaturated D-15 panel under standardised lighting conditions, Gulden Ophthalmics, Elkins Park, PA, USA), dilated slit lamp biomicroscopy, colour fundus photography and fundus autofluorescence (FAF) (Topcon CRC50DX/Optos Daytona, Topcon Great Britain Ltd., Berkshire, England/Optos plc, Dunfermline, Scotland), optical coherence tomography (OCT) (Cirrus 5000, Carl Zeiss Meditec, Berlin, Germany), and electroretinography (ERG) Roland Consult RETI-port retiscan, Brandenburg an der Havel, Germany), where available. The least affected reading was used for data illustration purposes in all cases.
Sequencing procedures. 13 affected individuals with ABCA4 c.4539 + 2028C > T underwent target capture next generation sequencing of the exons and known pathogenic intronic regions of ABCA4 as described previously 49 Variant categorisation. Variants designated "mild", "intermediate" or "null-like" were based on a previous comprehensive study by Fakin et al. 15 . Statistical analyses and data visualisation. Statistical analyses and data visualisation were performed using R version 4.0.3 (R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https:// www.R-proje ct. org/). The median age at onset of those harbouring ABCA4 c.4539 + 2028C > T were compared with the age at onset of individuals who carried two null/null-like variants, those who carried mild variants in trans with a null or null-like variants and those who carried intermediate variants in trans with a null or null-like variants previously reported 15 using a Kruskal-Wallis ranked sum test with post hoc analysis carried out using a pairwise Wilcoxon rank sum test, correcting for multiple testing using the Holm-Bonferroni method. Only one affected individual from each pedigree was included in these analyses in order to exclude other factors that may influence the phenotypic outcome, such as shared environment.
Editorial policies and ethical considerations. Prior  www.nature.com/scientificreports/